Electronic document processing method and device

ABSTRACT

In the method and device, electronic documents are created from a batch of physical documents of varying lengths. The batch comprises sub-batches of groups of multiple multi-page physical documents with separator sheets on each group. Each separator sheet is marked with a page count for any physical documents in its group. The batch is scanned-in and the resulting image pages are used to form the electronic documents. Specifically, the image pages are analyzed in sequence to identify document sheet image pages and separator sheet image pages. When a separator sheet image page is identified, the page count on that separator sheet image page is read and electronic document(s) is/are created from sequential set(s) of document sheet image pages, which follow the separator sheet image page and have the specific page count, until the next separator sheet image page is identified. This is repeated for all separator sheet image pages identified.

BACKGROUND

The method and device disclosed herein generally relate to electronicdocument processing and, more particularly, to automatically creatingmultiple electronic documents from a scanned-in batch of physicaldocuments having varying lengths (i.e., varying page counts).

Many workflows exist in which large numbers of physical documents (i.e.,hardcopies of documents) must be digitized. That is, the physicaldocuments must be scanned-in using an optical scanner in order to createelectronic documents (i.e., digitized documents, softcopies, etc.)corresponding to the physical documents. These electronic documents canthen be transmitted to an electronic records system, where they arefurther stored and/or processed. However, when the lengths of thephysical documents vary (i.e., when the physical documents havedifferent page counts), digitization of the physical documents can be adifficult and time-consuming task. Specifically, when varying-lengthphysical documents are scanned-in in a batch, back end electronicdocument processing must be manually performed in order to separate outthe electronic documents that correspond to the physical documents.Otherwise, such varying-length physical documents must be scanned-inindividually.

SUMMARY

In view of the foregoing disclosed herein is an electronic documentsprocessing method wherein electronic documents are automatically createdfrom a batch of physical documents of varying lengths. Specifically, thebatch comprises sub-batches of groups of multiple multi-page physicaldocuments of varying lengths with separator sheets on each group. Eachseparator sheet is marked with the page count of any physical documentscontained in its group. The batch is scanned-in and the resulting imagepages are used to form the electronic documents. Specifically, the imagepages are analyzed in sequence to identify document sheet image pagesand separator sheet image pages. When a separator sheet image page isidentified, the specific page count on that separator sheet image pageis read and electronic document(s) is/are created from sequential set(s)of document sheet image pages, which follow the separator sheet imagepage and have the specific page count, until the next separator sheetimage page is identified. This is repeated for all separator sheet imagepages identified. Also disclosed herein is an electronic documentsprocessing device.

More particularly, disclosed herein is a method for electronic documentprocessing.

Prior to any electronic document processing according to the method, abatch of media sheets can be formed by a user. This batch can be formedsuch that it comprises a plurality of sub-batches. The sub-batches cancomprise groups of physical documents comprising document sheets,wherein some or all of the groups comprise multiple multi-page physicaldocuments and the multiple multi-page physical documents in any givengroup each have the same page count (i.e., the same length). Thesub-batches can also comprise separator sheets on top of the groups andthese separator sheets can have markings indicating the page countsassociated with the groups. That is, each separator sheet can have amarking indicating the page count of any physical documents contained inits group. The markings on the separator sheets can be handwritten orpre-printed. Optionally, the markings can be in the form of barcodes.Also, optionally, no groups will have the same corresponding page count(i.e., no two groups will contain physical documents of the same length)such that the page counts marked on the different separator sheets willall be different. In any case, this batch can then be scanned-in (e.g.,by the user using an optical scanner with a single page template) toproduce multiple image pages. Since the media sheets within the batchcomprise both the document sheets from the varying-length physicaldocuments contained in the groups as well as the separator sheets on topof the groups, these image pages will comprise both document sheet imagepages corresponding to the document sheets of the physical documents andseparator sheet image pages corresponding to the separator sheets.

In the method, the multiple image pages can be received (e.g., by anelectronic documents processor) and used to form multiple electronicdocuments corresponding to all of the varying-length physical documentsin the batch.

Specifically, to form the electronic documents corresponding to thevarying-length physical documents in the batch, the method can compriseanalyzing each of the multiple image pages in sequence (e.g., as themultiple image pages are received by the electronic documents processor)in order to identify the image pages by type. That is, the image pagescan be analyzed in sequence to identify each image page as either aseparator sheet image page or a document sheet image page. Additionally,as the image pages are being analyzed in sequence, the followingprocesses can be performed. When a separator sheet image page isidentified, the specific page count can be read from a marking (e.g.,from a barcode) on the separator sheet image page. Then, electronicdocument(s) having that specific page count can be created from eachsequential set of document sheet image pages, which follow the separatorsheet image page and similarly have the specific page count. The processof creating electronic document(s) with the specific page count fromset(s) of document sheet image pages that follow a separator sheet imagepage can be performed until a subsequent separator sheet image page isidentified (i.e., until the next separator sheet image page isidentified). Then, the processes of reading the specific page count andcreating electronic document(s) with that specific page count can berepeated for the subsequent separator sheet image page and for eachsubsequent separator sheet image page identified thereafter (i.e., forall separator sheet image pages identified).

The method can further comprise communicating the multiple electronicdocuments (e.g., as they are created or subsequently) to an electronicrecords system for storage and/or further processing. For example, whenthe electronic documents processor is incorporated into a computerizeddevice that is on the same network as the electronic records system, theprocess of communicating the multiple electronic documents can beperformed automatically over that same network as the multipleelectronic documents are created, thereby avoiding any proprietarychanges in the code used by the electronic records system.Alternatively, when the electronic documents processor is incorporatedinto a computerized device that is not on the same network as theelectronic records system, the multiple electronic documents caninitially be stored in a memory of the computerized device.Subsequently, a polling request for any new electronic documents can bereceived by the computerized device from a software agent of theelectronic records system. Once the polling request is received, thesoftware agent can be allowed by the computerized device to download themultiple electronic documents from the memory to the electronic recordssystem, thereby also avoiding any proprietary changes in the code usedby the electronic records system.

Also disclosed herein is a computerized device for electronic documentprocessing.

Prior to any electronic document processing by this computerized device,a batch of media sheets can be formed by user. This batch can be formedsuch that it comprises a plurality of sub-batches. The sub-batches cancomprise groups of physical documents comprising document sheets,wherein some or all of the groups comprise multiple multi-page physicaldocuments and the multiple multi-page physical documents in any givengroup each have the same page count (i.e., the same length). Thesub-batches can also comprise separator sheets on top of the groups andthese separator sheets can have markings indicating the page countsassociated with the groups. That is, each separator sheet can have amarking indicating the page count of any physical documents contained inits group. The markings on the separator sheets can be handwritten orpre-printed. Optionally, the markings can be in the form of barcodes.Also, optionally, no groups will have the same corresponding page count(i.e., no two groups will contain physical documents of the same length)such that the page counts marked on the different separator sheets willall be different. In any case, this batch can be scanned-in (e.g., bythe user using an optical scanner with a single page template) toproduce multiple image pages. Since the media sheets within the batchcomprise both the document sheets from the varying-length physicaldocuments contained in the groups as well as the separator sheets on topof each group, the image pages will comprise both document sheet imagepages corresponding to the document sheets of the physical documents andseparator sheet image pages corresponding to the separator sheets.

The computerized device can comprise at least an input, an output, amemory and an electronic documents processor operatively connected tothe input, output and memory (e.g., over a system bus).

The input can receive the multiple image pages in sequence (i.e., in thesequence within which they were scanned-in) and the electronic documentsprocessor can use these multiple image pages to form multiple electronicdocuments corresponding to all the varying-length physical documents inthe batch.

Specifically, the electronic documents processor can analyze themultiple image pages in sequence (e.g., as the multiple image pages arereceived at the input) in order to identify the image pages by type.That is, the electronic documents processor can analyze the multipleimage pages in sequence to identify each image page as either aseparator sheet image page or a document sheet image page.

The electronic documents processor can further perform the followingprocesses as it analyzes the multiple image pages in sequence. When aseparator sheet image page is identified, the electronic documentsprocessor can read the specific page count can from a marking (e.g.,from a barcode) on that separator sheet image page. Then, the electronicdocuments processor can create electronic document(s) having thespecific page count from each sequential set of document sheet imagepages, which follow the separator sheet image page and similarly havethat specific page count. The electronic documents processor cancontinue to create electronic document(s) with the specific page countfrom set(s) of document sheet image pages that follow a separator sheetimage page until it identifies the next separator sheet image page(i.e., a subsequent separator sheet image page). The electronicdocuments processor can repeat the processes of reading the specificpage count and creating electronic document(s) with that specific pagecount for the subsequent separator sheet image page and for eachsubsequent separator sheet image page identified thereafter (i.e., forall separator sheet image pages identified).

The output can communicate the multiple electronic documents to anelectronic records system for storage and/or further processing (e.g.,as the multiple electronic documents are created or subsequently). Forexample, the computerized device can be on (i.e., connected to) the samenetwork as the electronic records system. In this case, the multipleelectronic documents can be automatically communicated over that samenetwork to the electronic records system as they are created, therebyavoiding any proprietary changes in the code used by the electronicrecords system. Alternatively, the computerized device can be on adifferent network than the electronic records system. In this case, theelectronic documents processor can store the multiple electronicdocuments in the memory as they are created. Subsequently, thecomputerized device can receive a polling request for any new electronicdocuments from a software agent of the electronic records system. Oncethe polling request is received, the computerized device can allow thesoftware agent to download the multiple electronic documents from thememory to the electronic records system, thereby also avoiding anyproprietary changes in the code used by the electronic records system.

These and other features are described in, or are apparent from, thefollowing detailed description.

BRIEF DESCRIPTION OF THE DRAWINGS

Various exemplary methods and devices are described in detail below,with reference to the attached drawing figures, in which:

FIG. 1 is a flow diagram illustrating a method for scanning-in a batchof varying-length physical documents;

FIG. 2 is a diagram illustrating a process of sorting physical documentsinto groups according to page count;

FIG. 3 is a diagram illustrating a process of placing separator sheetswith page count markings on the groups of FIG. 2 to form sub-batches;

FIG. 4 is a diagram illustrating a process of stacking the sub-batchesof FIG. 3 to form a batch;

FIG. 5 is a schematic diagram illustrating an exemplary computerizedscanning device for scanning in the batch of FIG. 4;

FIG. 6 is a flow diagram illustrating a method for performing back-endautomatic processing of multiple image pages received followingscanning-in of a batch of varying-length physical documents according tothe method of FIG. 1;

FIG. 7 is a schematic diagram illustrating communication between anelectronic records system and a computerized device that performsback-end automatic processing of multiple image pages received from acomputerized scanning device;

FIG. 8 is an alternative schematic diagram illustrating communicationbetween an electronic records system and a computerized device thatperforms back-end automatic processing of multiple image pages receivedfrom a computerized scanning device;

FIG. 9 is a flow diagram illustrating an exemplary technique forcreating electronic documents at process 604 of FIG. 6;

FIG. 10 is schematic diagram illustrating a computerized device forperforming back-end automatic processing of multiple image pagesreceived from a computerized scanning device; and

FIG. 11 is a schematic diagram illustrating an exemplary hardwareconfiguration for implementing the disclosed method and device.

DETAILED DESCRIPTION

As mentioned above, many workflows exist in which large numbers ofphysical documents (i.e., hardcopies of documents) must be digitized.That is, the physical documents must be scanned using an optical scannerin order to create electronic documents (i.e., digitized documents,softcopies, etc.) corresponding to the physical documents. Theseelectronic documents can then be transmitted to an electronic recordssystem, where they are further stored and/or processed. However, whenthe lengths of the physical documents vary (i.e., when the physicaldocuments have different page counts), digitization of the physicaldocuments can be a difficult and time-consuming task. Specifically, whenvarying-length physical documents are scanned-in in a batch, back endelectronic document processing must be manually performed to separateout the electronic documents that correspond to the physical documents.Otherwise, such varying-length physical documents must be scanned-inindividually.

More specifically, many businesses handle large amounts of physicaldocuments that must be digitized and communicated to electronic recordssystem for storage and/or processing. For example, healthcare businessestypically receive large amounts of physical documents by mail frompayers, labs, specialists, other clinics, patients, etc. This mail mustbe opened and the physical documents must be digitized and communicatedto an Electronic Medical Records (EMR) system and/or an ElectronicHealth Records (EHR) system for storage in an electronic recordassociated with a specific patient. While all EMR/EHR systems providesome support for receiving scanned-in electronic documents, the supportfor processing batch jobs varies widely across such EMR/EHR systems.

Unfortunately, while some EMR/EHR systems provide users (e.g., recordssystem administrators) with back end options for manually dividing asingle multi-page electronic document into multiple electronic documentsand/or manually combining multiple single-page electronic documents intomulti-page electronic documents, other EMR/EHR systems do not. In thiscase, front end scanning of the physical documents becomes morecumbersome because all electronic documents communicated to the EMR/EHRsystems must already correspond to specific physical documents.Specifically, physical documents must be scanned-in one at a time by anoptical scanner, requiring a document manager employed by the healthcarebusiness to spend a significant amount of time sorting and scanning thephysical documents.

An exemplary workflow for the document manager in this case comprisessorting the physical documents such that any single-page documents areplaced in one stack, any multi-page physical documents are placed inanother stack and alternate documents are rotated 90° within themulti-page physical documents stack so that the document manager caneasily separate them later. Next, the single-page document stack isplaced in the document feeder of an optical scanner and scanned-in as abatch using a single-page network scan template. That is, each page inthe single-page document stack is automatically fed by the documentfeeder through the optical scanner, scanned-in and saved as a singlesingle-page electronic document, such as a TIFF document or PDFdocument, and then transmitted to the an EMR or EHR system.Additionally, each multi-page document is scanned-in individually.Specifically, a first multi-page document is manually pulled off of themulti-page document stack and placed in the document feeder of anoptical scanner. Each page in the multi-page document is automaticallyfed by the document feeder through the optical scanner, scanned-in usingan appropriate multi-page network scan template (given the document pagecount) and saved as a single multi-page electronic document, such as aTIFF document or PDF document, and then transmitted to an EMR or EHRsystem. This workflow is repeated until no more multi-page physicaldocuments remain.

In view of the foregoing disclosed herein is an electronic documentsprocessing method wherein electronic documents are automatically createdfrom a batch of physical documents of varying lengths. Specifically, thebatch comprises sub-batches of groups of multiple multi-page physicaldocuments of varying lengths with separator sheets on each group. Eachseparator sheet is marked with the page count of any physical documentscontained in its group. The batch is scanned-in and the resulting imagepages are used to form the electronic documents. Specifically, the imagepages are analyzed in sequence to identify document sheet image pagesand separator sheet image pages. When a separator sheet image page isidentified, the specific page count on that separator sheet image pageis read and electronic document(s) is/are created from sequential set(s)of document sheet image pages, which follow the separator sheet imagepage and have the specific page count, until the next separator sheetimage page is identified. This is repeated for all separator sheet imagepages identified. Also disclosed herein is an electronic documentsprocessing device.

More particularly, disclosed herein is a method and device forelectronic document processing. In both the method and devicepreliminary processing of physical documents is initially performed toproduce multiple image pages (also referred to herein as digital imagepages or electronic image pages).

Specifically, referring to FIG. 1, in this preliminary processingphysical documents each comprising one or more document sheets (e.g.,sheets of paper) can be received (102). For example, in a typical officeenvironment, a mail clerk can receive hardcopy mail, can open thehardcopy mail and can place the physical documents contained therein inthe inbox of a document manager. The physical documents can then besorted into groups (e.g., by the document manager) based on page count(i.e., based on document length) such that each group contains onlyphysical documents having a specific page count (104).

For example, as illustrated in FIG. 2, at process 104 the physicaldocuments can be sorted into a first group 201 of 1-page physicaldocuments (e.g., see three 1-page physical documents 211 a-c), a secondgroup 202 of 2-page physical documents (e.g., see two 2-page physicaldocuments 212 a-c), a third group 203 of 3-page physical documents(e.g., see two 3-page physical documents 213 a-b), a fourth group 204 of4-page documents (e.g., see two 4-page physical documents 214 a-b), afifth group 205 of 5-page physical documents (e.g., see three 5-pagephysical documents 215 a-c), etc. For illustration purposes, five groups201-205 of physical documents are illustrated. However, it should beunderstood that the physical documents can be sorted into any number ofgroups as necessitated by the different lengths (i.e., different pagecounts) of the multi-page physical documents. Preferably the physicaldocuments can be sorted into at groups 201-205 at process 104 such thatno two groups have physical documents with the same number of pages.Furthermore, optionally, the first group 201 with 1-page documents canbe segregated and processed separately such that the remaining groups202-205 all contain at least one multi-page physical document. In anycase, at least some or all of the groups 201-205 into which the physicaldocuments are sorted will contain more than one multi-page physicaldocument.

Referring again to FIG. 1, after the physical documents are sorted intothe groups at process 104 as described above, separator sheets (alsoreferred to as separators, divider sheets, dividers, slip sheets,inserts, etc.) can be placed on top of each of the groups to formsub-batches for a subsequent scanning-in process and these separatorsheets can further have markings indicating the corresponding pagescounts for the physical documents within the groups (106).

For example, as illustrated in FIG. 3, at process 106 separator sheets221-225 can be placed on top of each of the groups 201-205 to formsub-batches 241-245, respectively. Each separator sheet 221-225 can havea marking 231-235 indicating the page count of any physical documentscontained in its group 201-205. That is, a first separator sheet 221 onthe first group 201 can have a first marking 231 indicating that all thedocuments contained therein are 1-page physical documents, a secondseparator sheet 222 on the second group 202 can have a second marking232 indicating that all the documents contained therein are 2-pagephysical documents, a third separator sheet 223 on the third group 203can have a third marking 233 indicating that all the documents containedtherein are 3-page physical documents, a fourth separator sheet 224 onthe fourth group 204 can have a fourth marking 234 indicating that allthe documents contained therein are 4-page documents, a fifth separatorsheet 225 on the fifth group 205 can have a fifth marking 235 indicatingthat all the documents contained therein are 5-page physical documents.

It should be noted that the markings 231-235 can be handwritten, typed,pre-printed, etc. The markings 231-235 can be in the form of text, whichindicates the page counts (e.g., “1-page documents”, “2-page documents”,etc.) and which can subsequently be read, for example, by an opticalcharacter recognition (OCR) tool to determine the page counts (asdiscussed in greater detail below). Alternatively, the markings 231-235can be in the form of coded images, such as barcodes (as illustrated),which indicate the page counts and which can subsequently be read, forexample, by a barcode reader to determine the page counts (as discussedin greater detail below). Optionally, as mentioned above, none of thegroups 201-205 will have the same corresponding page count (i.e., no twogroups will contain physical documents of the same length) and, in thiscase, the page count marking 231-235 on the different separator sheets221-225 will necessarily be different. Also, optionally, the separatorsheets 221-225 can be marked with additional handwritten or pre-printedmarkings (not shown) identifying the separator sheets as such and/or theseparator sheets 221-225 can be larger or smaller than the documentsheets (i.e., the separator sheets and document sheets can havedifferent sizes).

Referring again to FIG. 1, after the sub-batches are formed at process106 as described above, the sub-batches can be stacked to form a batchof media sheets, including the document sheets and separator sheets, asdiscussed above (108). For example, as illustrated in FIG. 4, thesub-batches 241-245 containing the groups 201-205 of physical documentswith separator sheets 221-225 thereon can be stacked to form the batch200. It should be noted that the batch 200 shown in FIG. 4 contains asub-batch containing the first group 201 of 1-page physical documents.As mentioned above, optionally, this first group 201 may be processesseparately such that it would not be contained within the batch 200 andsuch that only groups 202-205 with multi-page physical documents arecontained within the group.

Once the batch 200 of media sheets is formed at process 108, it can beplaced in the batch document feeder of an optical scanner and scanned-inusing a single page template (110)-(112). Specifically, the batch 200 ofmedia sheets, as described in detail above and illustrated in FIG. 4,can be scanned-in at process 112 using any computerized scanning devicethat incorporates or is operatively connected to an optical scannerhaving a batch document feeder.

For example, referring to FIG. 5, a computerized scanning device 500,such as a multi-function printing device, that comprises an opticalscanner 512 having a batch document feeder 513 can be used to scan-in abatch of media sheets, such as the batch 200 including both documentsheets and separator sheets, as described in detail above andillustrated in FIG. 4. The computerized scanning device 500 and,particularly, a multi-function printing device can comprise a pluralityof different functional components. These functional components cancomprise, for example, an optical scanner 512 with a document feeder 513capable of feeding a batch 200 of media sheets through the opticalscanner 512, a print media feeder 514, at least one print mediatransport path 516 incorporating a plurality of print media transportdevices 517, one or more print engines 510 (also referred to herein asprinting devices, marking devices, image rendering devices, etc.) alongthe print media transport path 516, a finisher 530, an input/output 526,a non-transitory computer readable storage medium 520 (e.g., an opticalstorage medium, magnetic storage medium, capacitor-based storage medium,etc.), a graphic user interface (GUI) 536 and a digital front end (DFE)controller 524.

The DFE controller 524 (also referred to herein as DFE processor orsimply as a controller) can comprise one or more computer processingunits (i.e., one or more processors) that can be operatively connectedto and can operatively control (e.g., can be adapted to operativelycontrol, can be configured to operatively control, can be programmed tooperatively control, etc.) the operations of the various functionalcomponents of the multi-function printing device 500, including theoptical scanner 512, print media feeder 514, print media transportpath(s) 516 and transport device(s) 517, print engine(s) 510, finisher530, input/output 526, computer readable storage medium 520, graphicuser interface (GUI) 536, etc. Additionally, the input/output 526 cancomprise a communication port 502 as well as a telephone line/facsimile(FAX) modem 503 to allow for communication between the multi-functionprinting device 500 and external computerized devices (e.g., servers orcomputer hardware systems). For example, the multi-function printingdevice 500 can communicate with external computerized device(s) 550 overa wired or wireless network 525 (e.g., a local area network (LAN), widearea network (WAN), internet, etc.) and/or over the telephone line/FAXmodem 503.

A scan operation can be selected by a user through the graphic userinterface (GUI) 536 and, in response, the document feeder 513 can feed(i.e., can be adapted to feed, can be configured to feed, etc.) mediasheets and, particularly, can feed media sheets in a single monolithicbatch through the optical scanner 512. The optical scanner 512, ascontrolled by the DFE controller 524, can then scan-in (e.g., can beadapted to scan-in, can be configured to scan-in, etc.) the media sheetsin order to produce image pages (also referred to herein as electronicor digital image pages) corresponding to the media sheets. In thepresent method, the user can specifically select, through the graphicuser interface (GUI) 536, the use of a single page template for the scanoperation such that the image pages are produced as discrete single pagedigital documents (i.e., also referred to herein as single pageelectronic documents). The resulting image pages can then becommunicated over the network 525 or telephone line/FAX modem 503 to anexternal computerized device 550 for further storage and/or furtherprocessing.

It should be noted that in the case where the computerized scanningdevice 500 comprises a multi-function printing device the user can also,optionally, select, through the graphic user interface (GUI) 536,production of hardcopies (i.e., printed copies) of the media sheets inthe batch 200. In this case, the print media feeder 514 can feed (e.g.,can be adapted to feed, can be configured to feed, etc.) print media(e.g., paper or other print media sheets) to a print media transportpath 516. The print media transport path(s) 516 and, particularly, theprint media transport devices 517 incorporated therein can transport(e.g., can be adapted to transport, can be configured to transported,etc.) print media through the print engine(s) 510 and to the finisher530. The print engine(s) 510 can print (i.e., mark) the print media asspecified in a print job and can comprise, for example, single color(i.e., monochrome) and/or multi-color print engine(s) that can print(e.g., can be adapted to print, can be configured to print, etc.) insimplex format (i.e., on a single surface of a print media sheet) and/orduplex format (i.e., on opposing surfaces of a print media sheet).Following printing on the print media, the finisher 530 can cut, fold,staple, sort, and/or stack the print media.

Various different types of computerized devices including, but notlimited to, multi-function printing devices, comprise optical scannerswith batch document feeders. Such computerized devices are well known inthe art and can be used to produce electronic image pages. Other typesof computerized devices may comprise digital cameras to produceelectronic image pages. Thus, the details of such computerized devicesare omitted from the specification in order to allow the reader to focuson the salient aspects of the disclosed method and device.

In any case, since the media sheets within the batch 200 comprise bothdocument sheets from the varying-length physical documents and theseparator sheets 221-225 on top of the groups 201-205 of physicaldocuments, the image pages produced from scanning-in the batch 200 ofmedia sheets at process 112 will comprise both document sheet imagepages corresponding to the document sheets and separator sheet imagepages corresponding to the separator sheets.

The method and device disclosed herein relate to back-end automaticprocessing of these multiple image pages to form electronic documentscorresponding to all the varying-length physical documents.

Specifically, referring to FIG. 6, the method can comprise receiving(e.g., by an electronic documents processor from a computerized scanningdevice) multiple image pages (602).

For example, referring to FIG. 7, an electronic documents processor(EDP) 555 can be incorporated into a computerized device 550 (e.g., aserver or computer hardware system), which is external to a computerizedscanning device 500 (e.g., a multi-function printing machine, asdescribed above) and not connected to the same network as thecomputerized scanning device 500. The electronic documents processor(EDP) 555 can receive multiple image pages transmitted over a telephoneline/FAX modem 503 from the computerized scanning device 500 following ascan operation wherein the computerized scanning device 500 scans-in abatch 200 of media sheets, as described in detail above and illustratedin FIG. 4. Alternatively, referring to FIG. 8, the electronic documentsprocessor (EDP) 555 can be incorporated into a computerized device 550(e.g., a server or computer hardware system), which is external tocomputerized scanning device 500 (e.g., a multi-function printingmachine, as described above) and connected to the same wired or wirelessnetwork 525 as the computerized scanning device 500. This electronicdocuments processor (EDP) 555 can receive multiple image pages over thenetwork 525 from a computerized scanning device (e.g., a multi-functionprinting machine 500) following a scan operation wherein thecomputerized scanning device 500 scans-in a batch 200 of media sheets,as described in detail above and illustrated in FIG. 4

In either case, multiple image pages can be received by the electronicdocuments processor (EDP) 555 in the sequence in which they werescanned-in (602). Once received that can, optionally, be stored in amemory 556 of the computerized device 550).

Next, the multiple image pages can be used (e.g., by the electronicdocuments processor (EDP) 555) to automatically form multiple electronicdocuments corresponding to all of the varying-length physical documentsin the batch 200 (604). Again, as mentioned above, since the mediasheets within the batch 200 comprise both document sheets of thevarying-length physical documents and separator sheets 221-225 on top ofthe groups 201-205 of physical documents, the image pages received bythe electronic documents processor (EDP) 555 will comprise both documentsheet image pages corresponding to the document sheets and separatorsheet image pages corresponding to the separator sheets.

In order to form electronic documents corresponding to the physicaldocuments in the batch 200 at process 604, each of the multiple imagepages can be analyzed in sequence (e.g., as the multiple image pages arereceived by the electronic documents processor (EDP) 555 or at somelater time) in order to identify the image pages by type (606). That is,the image pages can be analyzed in sequence to identify each image pageas either a separator sheet image page or a document sheet image page.Identification can be made by the electronic documents processor (EDP)555 based, for example, on the presence or absence of a page countmarking (e.g., the presence or absence of a handwritten page countmarking or a pre-printed page count marking, such as a pre-printed pagecount barcode) on the image page. Alternatively, the identification canbe made by the electronic documents processor (EDP) 555 based, forexample, on the presence or absence of an additional handwritten orpre-printed marking on the image page. Alternatively, the separatorsheet image pages and document sheet image pages may be different sizesand, thus, can be distinguished based on these different sizes.

Additionally, the following processes can also be performed by theelectronic documents processor (EDP) 555 as the image pages are beinganalyzed. Whenever a separator sheet image page is identified, thespecific page count can be read from a marking (e.g., on barcode) on theseparator sheet image page (608). For example, referring again to FIG.3, each marking 231-235 on each separator sheet 221-225 can be in theform of text, which is handwritten or pre-printed and which indicatesthe specific page count for its group 201-205 of physical documents(e.g., “1-page documents”, “2-page documents”, etc.). This text markingcan be read by an optical character recognition (OCR) tool of theelectronic documents processor (EDP) 555 in order to determine thespecific page count. Alternatively, each marking 231-235 on eachseparator sheet 221-225 can be in the form of a coded image, such as abarcode (as illustrated), which indicates the specific page count forits group 201-205 of physical documents. This coded image or barcode canbe read by a code reader, such as barcode reader, of the electronicdocuments processor (EDP) 555 in order to determine the specific pagecount.

Then, electronic document(s) having the specific page count can becreated from each sequential set of document sheet image pages, whichfollow the separator sheet image page and similarly have that specificpage count (610). This process of creating the electronic document(s)with the specific page count from set(s) of document sheet image pagesthat follow a separator sheet image page can be performed until asubsequent separator sheet image page is identified.

For example, if the specific page count is 3, then a first set of threedocument sheet image pages, which immediately follow the separator sheetimage page, can be extracted and used to create a first 3-pageelectronic document. If the image page immediately following the firstset is a document sheet image page, a second set of three document sheetimage pages, which immediately follow the first set, can be extractedand used to create a second 3-page electronic document. If the imagepage immediately following the first set is a document sheet image page,a third set of three document sheet image pages, which immediatelyfollow the second set, can be extracted and used to create a third3-page electronic document; and so on, until a subsequent separatorsheet image page is identified (i.e., until the next separator sheetimage page is identified).

Then, the processes of reading the specific page count (608) andcreating electronic document(s) with that specific page count (610) canbe repeated for the subsequent separator sheet image page and for eachsubsequent separator sheet image page identified thereafter (i.e., forall separator sheet image pages identified).

FIG. 9 is a flow diagram providing a more detailed exemplary processflow that can be used by the electronic documents processor (EDP) 555 tocreate the electronic documents at process 604. Specifically, an imagepage can be received (902). This image page can then be analyzed (e.g.,using the same techniques as described above) to determine whether ornot it is separator sheet image page (904). If this image page is aseparator sheet image page, a marking on the separator sheet image pagecan be read to determine the page count N (906). After the page count Nis determined, the next image page can be received and a determinationas to whether or not this next image page is a separator sheet imagepage can be made (902)-(904). If this next image page is not a separatorsheet image page, a new electronic document can be created (908) and thenext image page as well as each subsequent image page up to N imagepages total can be added to that new electronic document (910)-(914).Once N image pages total are added to that new document, the newelectronic document is complete and can be closed (916). After the newelectronic document is completed and closed, a determination can be madeas to whether or not any image pages remain (918). If so, the processesdescribed above are repeated (902)-(918) are repeated.

It should be noted that, when the required N image pages have not beenadded to a new document at process 912 and the next image page isretrieved at process 914, the next image page can, optionally, beanalyzed at process 920 before adding it to the new electronic documentat process 910 to verify (i.e., confirm) that it is actually a documentsheet image page as opposed to a separator sheet image page. If the nextimage page is a document sheet image page as expected, it will be addedto the new electronic document at process 910. However, if the nextimage page is not (i.e., if it is a separator sheet image page), the jobcan be automatically flagged as failing a job integrity check (922).Alternatively, a determination can be made as to whether or not thenumber of image pages between a separate sheet image page and the nextseparator sheet image page is divisible by the required page count N. Ifnot, the job can be automatically flagged as failing a job integritycheck. When a job is flagged as failing a job integrity check, a failreport can generated (e.g., automatically or on-demand) and that reportcan be output (e.g., to the device from which the image pages werereceived).

Referring again to FIG. 6, the method can further comprise communicatingthe multiple electronic documents to an electronic records system forstorage and/or further processing either automatically as the electronicdocuments are generated or upon request (612).

For example, as illustrated in FIG. 7, the electronic documentsprocessor (EDP) 555 can be incorporated into a computerized device 550(e.g., a server or computer hardware system) that is on (i.e., connectedto) the same network 575 as an electronic records system 580. In thiscase, the process of communicating the multiple electronic documents canbe performed by the electronic documents processor (EDP) 555automatically over that same network 575 as each new electronic documentis created, thereby avoiding any proprietary changes in the code used bythe electronic records system 580. In other words, electronic recordsystem protocols that already exist to enable scanners to transmitelectronic documents (e.g. TIFF or PDF) directly into the electronicrecords system can be leveraged when the computerized device 550 andelectronic records system 580 are on the same network 575 so that nochanges in the proprietary software of the electronic records system 580is needed.

Alternatively, as illustrated in FIG. 8, the electronic documentsprocessor (EDP) 555 can be incorporated into a computerized device 550(e.g., a server or computer hardware system) that is not on (i.e., notconnected to) the same network as the electronic records system 580. Inthis case, a polling request may be used to initiate on-demandcommunication of the multiple electronic documents to the electronicrecords system 580. Specifically, in this case, the multiple electronicdocuments can initially be stored by the electronic documents processor(EDP) 555 in a memory 556 of the external computerized device 550. Whena polling request for any new electronic documents is received by thecomputerized device 550 from a software agent of an electronic recordssystem 580, the software agent can be allowed by the externalcomputerized device 550 to download the multiple electronic documentsfrom the memory 556 to the electronic records system 580, thereby alsoavoiding any proprietary changes in the code used by the electronicrecords system 580.

Also, referring to FIG. 10, disclosed herein is a computerized device550 (e.g., a server or a computer hardware system) for electronicdocument processing.

It should be noted that prior to any electronic document processing bythe device 550, a batch of media sheets can be formed (e.g., in themanner set forth in the flow diagram of FIG. 1). Specifically, thisbatch 200 can be formed in the manner set forth in the flow diagram ofFIG. 1 so that it comprises a plurality of sub-batches 241-245, asillustrated in FIG. 4. The sub-batches 241-245 can comprise groups201-205 of physical documents comprising document sheets, wherein someor all of the groups 201-205 comprise multiple multi-page physicaldocuments and the multiple multi-page physical documents in any givengroup each have the same page count (i.e., the same length) (e.g., see2-page documents 212 a-c of group 202, 3-page physical documents 213 a-bof group 203, and so on). The sub-batches 241-245 can also compriseseparator sheets 221-225 (also referred to as separators, dividersheets, dividers, slip sheets, inserts, etc.) on top of the groups201-205 and these separator sheets 221-225, as shown in FIG. 3, can havemarkings 231-235 indicating the page counts associated with the groups201-205. That is, each separator sheet 221-225 can have a markingindicating the page count of any physical documents contained in itsgroup 201-205. The markings 231-235 on the separator sheets 221-225 canbe handwritten or pre-printed and can be text or code images (e.g.,barcodes). Optionally, no groups will have the same corresponding pagecount (i.e., no two groups will contain physical documents of the samelength) such that the page counts marked on the different separatorsheets will all be different.

In any case, this batch 200 can then be scanned-in to produce multipleimage pages. Specifically, the batch 200 of media sheets can bescanned-in using any computerized scanning device 500 (e.g., amulti-function printing device as shown in FIG. 5) that incorporates oris operatively connected to an optical scanner having a batch documentfeeder. Since the media sheets within the batch 200 comprise both thedocument sheets from the various physical documents contained in thegroups as well as the separator sheets on top of each group, the imagepages produced from scanning-in the batch of media sheets will compriseboth document sheet image pages corresponding to the document sheets andseparator sheet image pages corresponding to the separator sheets.

The computerized device 550 can comprise a server or computer hardwaresystem. For example, as illustrated in FIG. 11, the computerized device550 can comprise at least one processor or central processing unit (CPU)10, including an electronic documents processor (EDP) 555. The CPUs 10,including the electronic documents processor (EDP) 555, can beinterconnected via a system bus 12 to various device components such asa random access memory (RAM) 14, read-only memory (ROM) 16, and aninput/output (I/O) adapter 18. The I/O adapter 18 can peripheral storagedevices 11 to the system bus 12. The peripheral storage devices 11 cancomprise data and/or program storage devices, which are readable by thecomputerized device 550 and, particularly, the CPUs 10, including theelectronic documents processor (EDP) 555. The CPUs 10, including theelectronic documents processor (EDP) 555, can read data and inventiveprograms stored on the storage devices 11 and can execute theinstructions to perform the disclosed method. Optionally, thecomputerized device 550 can further comprise a user interface adapter 19that connects user input devices 15 (e.g., a keyboard, mouse,microphone, etc.) to the system bus 12. The computerized device 550 canalso comprise a communication adapter 20, which connects the system bus12 to a network 25, and a display adapter 21, which connects the systembus 12 to a display device 23 which may be embodied as an output devicesuch as a monitor, printer, or transmitter, for example.

In any case, referring again to FIG. 10, this computerized device 550can comprise at least an input 557, an output 559, a memory 556 and anelectronic documents processor (EDP) 555 operatively connected to theinput 557, output 559 and memory 556 (e.g., via a system bus 558).

The input 557 can receive (i.e., can be adapted to receive, can beconfigured to receive, etc.) the multiple image pages in sequence fromthe computerized scanning device 500. Optionally, the multiple imagepages can be stored in memory 556 (e.g., see stored image pages 562).The electronic documents processor (EDP) 555 can use these multipleimage pages to form multiple electronic documents corresponding to allof the varying-length physical documents in the batch 200.

The electronic documents processor (EDP) 555 can analyze (i.e., can beadapted to analyze, can be configured to analyze, can read and execute aprogram of instructions 561 stored in the memory 556 in order toanalyze, etc.) the multiple image pages in sequence in order to identifythe image pages by type (e.g., as a separator sheet image page or as adocument sheet image page). It should be noted that the electronicdocuments processor (EDP) 555 can automatically analyze the image pagesin sequence as each image page is received at the input 557.Alternatively, as mentioned above, the multiple image pages caninitially be stored in the memory 556 (see stored image pages 562). Inthis case, the electronic documents processor (EDP) 555 can, in responseto a demand, request or some other triggering condition, access thestored image pages 562 and analyze them in sequence.

Specifically, the electronic documents processor (EDP) 555 can comprisean image analysis (IA) tool 565 that identifies (i.e., is adapted toidentify, is configured to identify, reads and executes a program ofinstructions 561 stored in memory 556 to identify, etc.) an image pageas a separator sheet image page or a document sheet image page based,for example, on the presence or absence, respectively, of a page countmarking (e.g., the presence or absence of a handwritten page countmarking or a pre-printed page count marking, such as a pre-printed pagecount barcode) on the image page. Alternatively, the image analysis (IA)tool 565 can identify (i.e., can be adapted to identify, can beconfigured to identify, can read and execute a program of instructions561 stored in memory 556 to identify, etc.) an image page as a separatorsheet image page or a document sheet image page based, for example, onthe presence or absence, respectively, of an additional handwritten orpre-printed marking on the image page. Alternatively, the image analysis(IA) tool 565 can identify (i.e., can be adapted to identify, can beconfigured to identify, can read and execute a program of instructions561 stored in memory 556 to identify, etc.) an image page as a separatorsheet image page or a document sheet image page based, for example, onthe presence or absence, respectively, based on the different sizes ofsuch image pages.

Additionally, as the electronic documents processor (EDP) 555 analyzesthe image pages in sequence, it can further perform (i.e., be adapted toperform, be configured to perform, read and execute a program ofinstructions 561 stored in memory 556 to perform, etc.) the followingprocesses. When a separator sheet image page is identified, theelectronic documents processor (EDP) 555 can read the specific pagecount can from a marking (e.g., a text marking or coded image marking,such as a barcode) on that separator sheet image page. For example, theelectronic documents processor (EDP) 555 can comprise an opticalcharacter recognition (OCR) tool 563. When the markings 531-535 on theseparator sheets 521-525, as discussed above and illustrated in FIG. 3,comprise handwritten or pre-printed text markings, this opticalcharacter recognition (OCR) tool 563 can recognize (i.e., can be adaptedto recognize, can be configured to recognize, can read and execute aprogram of instructions 561 store in memory 556 to recognize, etc.) thespecific page count on the separator sheet image page. Additionally oralternatively, the electronic documents processor (EDP) 555 can comprisea code reader 564 (e.g., a barcode reader). When the markings 531-535 onthe separator sheets 521-525, as discussed in detail above andillustrated in FIG. 3, comprise coded image markings (e.g., barcodes),this code reader 564 (e.g., barcode reader) can read the coded image(i.e., can be adapted to read, can be configured to read, can read andexecute a program of instructions 561 stored in memory 556 to read,etc.) the specific page count on the separator sheet image page.

The electronic documents processor (EDP) 555 can further create (i.e.,can be adapted to create, can be configured to create, can read andexecute a program of instructions 561 stored in memory 556 to create,etc.) electronic document(s) having that specific page count from eachsequential set of document sheet image pages, which follow the separatorsheet image page and similarly have that specific page count. Theelectronic documents processor (EDP) 555 can create electronicdocument(s) with the specific page count from set(s) of document sheetimage pages that follow a separator sheet image page until it identifiesa subsequent separator sheet image page (i.e., until it next identifiesa separator sheet image page).

For example, if the electronic documents processor (EDP) 555 reads aspecific page count of 3 off a separator sheet image page, then theelectronic documents processor (EDP) 555 can extract a first set ofthree document sheet image pages, which immediately follow the separatorsheet image page, and use that first set to create a first 3-pageelectronic document. If the electronic documents processor (EDP) 555determines that the image page following the first set is a documentsheet image page, the electronics document processor (EDP) 555 canextract a second set of three document sheet image pages, whichimmediately follow the first set, and use that second set to create asecond 3-page electronic document. If the electronic documents processor(EDP) 555 determines that the image page following the second set is adocument sheet image page, the electronics document processor (EDP) 555can extract a third set of three document sheet image pages, whichimmediately follow the second set, and use that third set to create athird 3-page electronic document; and so on, until a subsequentseparator sheet image page is identified (i.e., until the next separatorsheet image page is identified).

The electronic documents processor (EDP) 555 can repeat (i.e., can beadapted to repeat, can be configured to repeat, can read and execute aprogram of instructions 561 stored in memory 556 to repeat, etc.) theprocesses of reading the specific page count and creating electronicdocument(s) with that specific page count for the subsequent separatorsheet image page and for each subsequent separator sheet image pageidentified thereafter (i.e., for all separator sheet image pagesidentified).

It should be noted that the EDP 555 verify (i.e., confirm) (i.e., can beadapted to verify, can be configured to verify, etc.) that image pagesextracted to create new electronic documents are document sheet imagepages as expected given the specified page count. If an image page in agiven set of image pages extracted to create new electronic document isnot a document sheet image page, but rather a separator sheet imagepage, the EDP 555 can automatically flag the job as failing a jobintegrity check. Alternatively, the EDP 555 can determine (i.e., can beadapted to determine, can be configured to determine, etc.) whether ornot the number of image pages between a separate sheet image page andthe next separator sheet image page is divisible by the required pagecount. If not, the EDP 555 can automatically flag the job as failing ajob integrity check.

The computerized device 550 can further comprise an output 559 that cancommunicate (i.e., can be adapted to communicate, can be configured tocommunicate, can read and execute a program of instructions 561 storedin memory 556 to communicate, etc.) the multiple electronic documents toan electronic records system for storage and/or further processing. Suchcommunication can be either automatic as the electronic documents arecreated or can be only in response to a demand, request or othertriggering condition.

For example, as illustrated in FIG. 7, the computerized device 550 canbe on (i.e., connected to) the same wired or wireless network 575 as theelectronic records system 580. In this case, the multiple electronicdocuments can be communicated automatically over that same network 575to the electronic records system as they are created by the electronicdocuments processor 555, thereby avoiding any proprietary changes in thecode used by the electronic records system. In other words, electronicrecord system protocols that already exist to enable scanners totransmit electronic documents (e.g. TIFF or PDF) directly into theelectronic records system can be leveraged when the computerized device550 and electronic records system 580 are on the same network 575 sothat no changes in the proprietary software of the electronic recordssystem 580 is needed.

Alternatively, as illustrated in FIG. 8, the computerized device 550 canbe one (i.e., connected to) a different network than the electronicrecords system 580. For example, the computerized device 550 can be on(i.e., connected to) the same network 525 as the computerized scanningdevice 500 that produces the multiple image pages. In this case, theelectronic documents processor (EDP) 555 can initially store themultiple electronic documents in the memory 556 (e.g., see storedelectronic documents 566). The computerized device 550 can subsequentlyreceive a polling request for any new electronic documents from asoftware agent of the electronic records system 580. In response to thispolling request, the computerized device 550 can allow the softwareagent to download the stored electronic documents 566 from the memory556 to the electronic records system 580, thereby also avoiding anyproprietary changes in the code used by the electronic records system.

It should be noted that in the case where the EDP 555 flags a job asfailing a job integrity check, the EDP 555 can further generate (i.e.,can be adapted to generate, can be configured to generate, etc.) a failreport and the output 559 can communicate that fail report (e.g., to thedevice from which the image pages were received).

It should be noted that in the method and device described above thecomputerized scanning device can be either a simplex or duplex scanningdevice. Those skilled in the art will recognize that, for duplexscanning, the specified page count marked on the separator sheets mustcorrespond to the actual page count and not the document sheet count.Alternatively, for duplex scanning, the specific page count marked onthe separator sheets can correspond to the document sheet count;however, for purposes of extracting the image pages and creating theelectronic documents, this specified page count must be multiplied bytwo.

Many computerized devices are discussed above. Computerized devices thatinclude chip-based central processing units (CPU's), input/outputdevices (including graphic user interfaces (GUI), memories, comparators,processors, etc. are well-known and readily available devices producedby manufacturers such as Dell Computers, Round Rock Tex., USA and AppleComputer Co., Cupertino Calif., USA. Such computerized devices commonlyinclude input/output devices, power supplies, processors, electronicstorage memories, wiring, etc., the details of which are omittedherefrom to allow the reader to focus on the salient aspects of thesystems and methods described herein. Similarly, scanners and othersimilar peripheral equipment are available from Xerox Corporation,Norwalk, Conn., USA and the details of such devices are not discussedherein for purposes of brevity and reader focus.

The terms printer or printing device as used herein encompasses anyapparatus, such as a digital copier, bookmaking machine, facsimilemachine, multi-function machine, etc., which performs a print outputtingfunction for any purpose. The details of printers, printing engines,etc., are well known and are not described in detail herein to keep thisdisclosure focused on the salient features presented. The systems andmethods herein can encompass systems and methods that print in color,monochrome, or handle color or monochrome image data. All foregoingsystems and methods are specifically applicable to electrostatographicand/or xerographic machines and/or processes.

In addition, terms such as “right”, “left”, “vertical”, “horizontal”,“top”, “bottom”, “upper”, “lower”, “under”, “below”, “underlying”,“over”, “overlying”, “parallel”, “perpendicular”, etc., used herein areunderstood to be relative locations as they are oriented and illustratedin the drawings (unless otherwise indicated). Terms such as “touching”,“on”, “in direct contact”, “abutting”, “directly adjacent to”, etc.,mean that at least one element physically contacts another element(without other elements separating the described elements). Further, theterms automated or automatically mean that once a process is started (bya machine or a user), one or more machines perform the process withoutfurther input from any user.

It will be appreciated that the above-disclosed and other features andfunctions, or alternatives thereof, may be desirably combined into manyother different systems or applications. Various presently unforeseen orunanticipated alternatives, modifications, variations, or improvementstherein may be subsequently made by those skilled in the art which arealso intended to be encompassed by the following claims. Unlessspecifically defined in a specific claim itself, steps or components ofthe systems and methods herein cannot be implied or imported from anyabove example as limitations to any particular order, number, position,size, shape, angle, color, or material.

Therefore, disclosed above are an electronic documents processing methodand device wherein electronic documents are automatically created from abatch of physical documents of varying lengths. Specifically, the batchcomprises sub-batches of groups of multiple multi-page physicaldocuments of varying lengths with separator sheets on each group. Eachseparator sheet is marked with the page count of any physical documentscontained in its group. The batch is scanned-in and the resulting imagepages are used to form the electronic documents. Specifically, the imagepages are analyzed in sequence to identify document sheet image pagesand separator sheet image pages. When a separator sheet image page isidentified, the specific page count on that separator sheet image pageis read and electronic document(s) is/are created from sequential set(s)of document sheet image pages, which follow the separator sheet imagepage and have the specific page count, until the next separator sheetimage page is identified. This is repeated for all separator sheet imagepages identified.

It should be noted that the method and device described above eliminatesthe need to perform manual back end processing of electronic documents.It also eliminates the need to sort physical documents into discretebatches according to page length for scanning in purposes and/or theneed to insert separator sheets (i.e., slip sheets) in between eachphysical document when scanning in a batch of multi-length physicaldocuments. Thus, the method and device provide a significant time savingadvantage over previously used methods and devices for creatingelectronic documents from physical documents of varying lengths.

What is claimed is:
 1. A method comprising: receiving, by an electronicdocuments processor of a computerized device, multiple image pagesproduced by scanning a batch of media sheets, said batch comprising aplurality of sub-batches, said sub-batches comprising: groups ofphysical documents comprising document sheets, wherein at least some ofsaid groups comprise multiple physical documents and each of saidmultiple physical documents in a given group have a same page count; andseparator sheets on top of said groups, said separator sheets havingmarkings indicating page counts associated with said groups, and saidmultiple image pages comprising document sheet image pages correspondingto said document sheets and separator sheet image pages corresponding tosaid separator sheets; and forming, by said electronic documentsprocessor, multiple electronic documents corresponding to all physicaldocuments in said batch, said forming being performed by analyzing eachof said multiple image pages in sequence to identify page type and,during said analyzing, performing the following: when a separator sheetimage page is identified, reading a specific page count from a markingon said separator sheet image page; creating an electronic documenthaving said specific page count from each sequential set of documentsheet image pages following said separator sheet image page and havingsaid specific page count, said creating being performed until asubsequent separator sheet image page is identified; and repeating saidreading and said creating for all separator sheet image pagesidentified.
 2. The method of claim 1, said multiple physical documentseach comprising multi-page physical documents.
 3. The method of claim 1,said page counts being different on each of said separator sheets. 4.The method of claim 1, said markings being any one of pre-printedmarkings and handwritten markings.
 5. The method of claim 1, furthercomprising automatically communicating, by an output of saidcomputerized device, said multiple electronic documents to an electronicrecords system over a network connected to both said computerized deviceand said electronic records system.
 6. The method of claim 1, furthercomprising: storing, in a memory of said computerized device, saidmultiple electronic documents; receiving, by said computerized devicefrom a software agent of an electronic records system, a polling requestfor any new electronic documents; and after said receiving of saidpolling request, allowing, by said computerized device, said softwareagent to download said multiple electronic documents to said electronicrecords system.
 7. The method of claim 1, further comprising:automatically performing job integrity checks by performing any one ofthe following: verifying that each image page placed in a new electronicdocument is a document sheet image page; and determining that a numberof image pages between said separator sheet image page and saidsubsequent separator sheet image page is divisible by said specific pagecount.
 8. A method comprising: receiving, by an electronic documentsprocessor of a computerized device, multiple image pages produced byscanning a batch of media sheets using a single page template, saidbatch comprising a plurality of sub-batches, said sub-batchescomprising: groups of physical documents comprising document sheets,wherein at least some of said groups comprise multiple physicaldocuments and each of said multiple physical documents in a given grouphave a same page count; and separator sheets on top of said groups, saidseparator sheets having barcodes indicating page counts associated withsaid groups, and said multiple image pages comprising document sheetimage pages corresponding to said document sheets and separator sheetimage pages corresponding to said separator sheets; and forming, by saidelectronic documents processor, multiple electronic documentscorresponding to all physical documents in said batch, said formingbeing performed by analyzing each of said multiple image pages insequence, as said multiple image pages are received by said electronicdocuments processor, to identify page type and, during said analyzing,performing the following: when a separator sheet image page isidentified, reading a specific page count from a barcode on saidseparator sheet image page; creating an electronic document having saidspecific page count from each sequential set of document sheet imagepages identified following said separator sheet image page and havingsaid specific page count, said creating being performed until asubsequent separator sheet image page is identified; and repeating saidreading and said creating for all separator sheet image pagesidentified.
 9. The method of claim 8, said multiple physical documentseach comprising multi-page physical documents.
 10. The method of claim8, said page counts being different on each of said separator sheets.11. The method of claim 8, further comprising automaticallycommunicating, by an output of said computerized device, said multipleelectronic documents to an electronic records system over a networkconnected to both said computerized device and said electronic recordssystem.
 12. The method of claim 8, further comprising: storing, in amemory of said computerized device, said multiple electronic documents;receiving, by said computerized device from a software agent of anelectronic records system, a polling request for any new electronicdocuments; and after said receiving of said polling request, allowing,by said computerized device, said software agent to download saidmultiple electronic documents to said electronic records system.
 13. Themethod of claim 8, further comprising: automatically performing jobintegrity checks by performing any one of the following: verifying thateach image page placed in a new electronic document is a document sheetimage page; and determining that a number of image pages between saidseparator sheet image page and said subsequent separator sheet imagepage is divisible by said specific page count.
 14. A computerized devicecomprising: an input receiving multiple image pages; and an electronicdocuments processor operatively connected to said input, said multipleimage pages produced by scanning a batch of media sheets, said batchcomprising a plurality of sub-batches, said sub-batches comprising:groups of physical documents comprising document sheets, wherein atleast some of said groups comprise multiple physical documents and eachof said multiple physical documents in a given group have a same pagecount; and separator sheets on top of said groups, said separator sheetshaving markings indicating page counts associated with said groups, andsaid multiple image pages comprising document sheet image pagescorresponding to said document sheets and separator sheet image pagescorresponding to said separator sheets, said electronic documentsprocessor further forming multiple electronic documents corresponding toall physical documents in said batch by analyzing each of said multipleimages pages in sequence to identify page type and, during saidanalyzing, performing the following: when a separator sheet image pageis identified, reading a specific page count from a marking on saidseparator sheet image page; creating an electronic document having saidspecific page count from each sequential set of document sheet imagepages following said separator sheet image page and having said specificpage count, said creating being performed until a subsequent separatorsheet image page is identified; and repeating said reading and saidcreating for all separator sheet image pages identified.
 15. Thecomputerized device of claim 14, said multiple physical documents eachcomprising multi-page physical documents.
 16. The computerized device ofclaim 14, said page counts being different on each of said separatorsheets.
 17. The computerized device of claim 14, said markings being anyone of pre-printed markings and handwritten markings.
 18. Thecomputerized device of claim 14, further being connected by a network toan electronic records system and further comprising an outputautomatically communicating said multiple electronic documents to saidelectronic documents processor over said network.
 19. The computerizeddevice of claim 14, further comprising a memory, said electronicdocuments processor storing said multiple electronic documents in saidmemory, and said computerized device receiving a polling request for anynew electronic documents from a software agent of an electronic recordssystem and, after said receiving of said polling request, allowing saidsoftware agent to download said multiple electronic documents to saidelectronic records system.
 20. A computerized device comprising: aninput receiving multiple image pages; and an electronic documentsprocessor operatively connected to said input, said multiple image pagesproduced by scanning a batch of media sheets using a single pagetemplate, said batch comprising a plurality of sub-batches, saidsub-batches comprising: groups of physical documents comprising documentsheets, wherein at least some of said groups comprise multiple physicaldocuments and each of said multiple physical documents in a given grouphave a same page count; and separator sheets on top of said groups, saidseparator sheets having barcodes indicating page counts associated withsaid groups, and said multiple image pages comprising document sheetimage pages corresponding to said document sheets and separator sheetimage pages corresponding to said separator sheets, said electronicdocuments processor further forming multiple electronic documentscorresponding to all physical documents in said batch by analyzing eachof said multiple image pages in sequence, as said multiple image pagesare received by said electronic documents processor, to identify pagetype and, during said analyzing, performing the following: when aseparator sheet image page is identified, reading a specific page countfrom a barcode on said separator sheet image page; creating anelectronic document having said specific page count from each sequentialset of document sheet image pages identified following said separatorsheet image page and having said specific page count, said creatingbeing performed until a subsequent separator sheet image page isidentified; and repeating said reading and said creating for allseparator sheet image pages identified.
 21. The computerized device ofclaim 20, said multiple physical documents each comprising multi-pagephysical documents.
 22. The computerized device of claim 20, said pagecounts being different on each of said separator sheets.
 23. Thecomputerized device of claim 20, further being connected by a network toan electronic records system and further comprising an outputautomatically communicating said multiple electronic documents to saidelectronic documents processor over said network.
 24. The computerizeddevice of claim 20, further comprising a memory, said electronicdocuments processor storing said multiple electronic documents in saidmemory, and said computerized device receiving a polling request for anynew electronic documents from a software agent of an electronic recordssystem and, after said receiving of said polling request, allowing saidsoftware agent to download said multiple electronic documents to saidelectronic records system.